NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Machine learning materials properties with accurate predictions, uncertainty estimates, domain guidance, and persistent online accessibility

https://doi.org/10.1088/2632-2153/ad95db

Jacobs, Ryan; Schultz, Lane_E; Scourtas, Aristana; Schmidt, KJ; Price-Skelly, Owen; Engler, Will; Foster, Ian; Blaiszik, Ben; Voyles, Paul_M; Morgan, Dane (December 2024, Machine Learning: Science and Technology)

Abstract One compelling vision of the future of materials discovery and design involves the use of machine learning (ML) models to predict materials properties and then rapidly find materials tailored for specific applications. However, realizing this vision requires both providing detailed uncertainty quantification (model prediction errors and domain of applicability) and making models readily usable. At present, it is common practice in the community to assess ML model performance only in terms of prediction accuracy (e.g. mean absolute error), while neglecting detailed uncertainty quantification and robust model accessibility and usability. Here, we demonstrate a practical method for realizing both uncertainty and accessibility features with a large set of models. We develop random forest ML models for 33 materials properties spanning an array of data sources (computational and experimental) and property types (electrical, mechanical, thermodynamic, etc). All models have calibrated ensemble error bars to quantify prediction uncertainty and domain of applicability guidance enabled by kernel-density-estimate-based feature distance measures. All data and models are publicly hosted on the Garden-AI infrastructure, which provides an easy-to-use, persistent interface for model dissemination that permits models to be invoked with only a few lines of Python code. We demonstrate the power of this approach by using our models to conduct a fully ML-based materials discovery exercise to search for new stable, highly active perovskite oxide catalyst materials.
more » « less
A Practical Guide to Machine Learning Interatomic Potentials – Status and Future

Jacobs, Ryan; Morgan, Dane; Attarian, Siamak; Meng, Jun; Shen, Chen; Wu, Zhenghao; Xie, Clare; Yang, Julia H; Artrith, Nongnuch; Blaiszik, Ben; et al (January 2025, Current opinion in solid state materials science)

The rapid development and large body of literature on machine learning potentials (MLPs) can make it difficult to know how to proceed for researchers who are not experts but wish to use these tools. The spirit of this review is to help such researchers by serving as a practical, accessible guide to the state-of-the-art in MLPs. This review paper covers a broad range of topics related to MLPs, including (i) central aspects of how and why MLPs are enablers of many exciting advancements in molecular modeling, (ii) the main underpinnings of different types of MLPs, including their basic structure and formalism, (iii) the potentially transformative impact of universal MLPs for both organic and inorganic systems, including an overview of the most recent advances, capabilities, downsides, and potential applications of this nascent class of MLPs, (iv) a practical guide for estimating and understanding the execution speed of MLPs, including guidance for users based on hardware availability, type of MLP used, and prospective simulation size and time, (v) a manual for what MLP a user should choose for a given application by considering hardware resources, speed requirements, energy and force accuracy requirements, as well as guidance for choosing pre-trained potentials or fitting a new potential from scratch, (vi) discussion around MLP infrastructure, including sources of training data, pre-trained potentials, and hardware resources for training, (vii) summary of some key limitations of present MLPs and current approaches to mitigate such limitations, including methods of including long-range interactions, handling magnetic systems, and treatment of excited states, and finally (viii) we finish with some more speculative thoughts on what the future holds for the development and application of MLPs over the next 3-10+ years.
more » « less
Full Text Available
A practical guide to machine learning interatomic potentials – Status and future

https://doi.org/10.1016/j.cossms.2025.101214

Jacobs, Ryan; Morgan, Dane; Attarian, Siamak; Meng, Jun; Shen, Chen; Wu, Zhenghao; Xie, Clare Yijia; Yang, Julia H; Artrith, Nongnuch; Blaiszik, Ben; et al (March 2025, Current Opinion in Solid State and Materials Science)

The rapid development and large body of literature on machine learning interatomic potentials (MLIPs) can make it difficult to know how to proceed for researchers who are not experts but wish to use these tools. The spirit of this review is to help such researchers by serving as a practical, accessible guide to the state-of-the-art in MLIPs. This review paper covers a broad range of topics related to MLIPs, including (i) central aspects of how and why MLIPs are enablers of many exciting advancements in molecular modeling, (ii) the main underpinnings of different types of MLIPs, including their basic structure and formalism, (iii) the potentially transformative impact of universal MLIPs for both organic and inorganic systems, including an overview of the most recent advances, capabilities, downsides, and potential applications of this nascent class of MLIPs, (iv) a practical guide for estimating and understanding the execution speed of MLIPs, including guidance for users based on hardware availability, type of MLIP used, and prospective simulation size and time, (v) a manual for what MLIP a user should choose for a given application by considering hardware resources, speed requirements, energy and force accuracy requirements, as well as guidance for choosing pre-trained potentials or fitting a new potential from scratch, (vi) discussion around MLIP infrastructure, including sources of training data, pre-trained potentials, and hardware resources for training, (vii) summary of some key limitations of present MLIPs and current approaches to mitigate such limitations, including methods of including long-range interactions, handling magnetic systems, and treatment of excited states, and finally (viii) we finish with some more speculative thoughts on what the future holds for the development and application of MLIPs over the next 3–10+ years.
more » « less
Free, publicly-accessible full text available March 1, 2026
Foundry-ML - Software and Services to Simplify Accessto Machine Learning Datasets in Materials Science

https://doi.org/10.21105/joss.05467

Schmidt, KJ; Scourtas, Aristana; Ward, Logan; Wangen, Steve; Schwarting, Marcus; Darling, Isaac; Truelove, Ethan; Ambadkar, Aadit; Bose, Ribhav; Katok, Zoa; et al (January 2024, Journal of Open Source Software)

Full Text Available
Infrastructure for Analysis of Large Microscopy and Microanalysis Data Sets

https://doi.org/10.1017/s1431927622011539

Wei, Jingrui; Francis, Carter; Morgan, Dane; Schmidt, KJ; Scourtas, Aristana; Foster, Ian; Blaiszik, Ben; Voyles, Paul M (August 2022, Microscopy and Microanalysis)

Full Text Available

Search for: All records